Aller au contenu principal

Data Types

image

Structured Data

Structured data is data that is organized in a tabular format with rows and columns. Each column represents a specific attribute, and each row represents a single record.

Structured data is highly organized and easily searchable. It is typically stored in relational databases and can be queried using SQL.

Examples:

  • CSV
  • Excel
  • SQL databases (MySQL, PostgreSQL, Oracle)
  • Google Sheets
  • Parquet
  • ...

Semi-Structured Data

Semi-structured data is data that does not fit into a tabular format but has some organizational properties that make it easier to process.

Semi-structured data is flexible and adaptable. It can be stored in NoSQL databases or document-oriented databases.

Examples:

  • JSON
  • XML
  • YAML
  • Log files
  • Email
  • ...

Unstructured Data

Unstructured data is data that does not have a predefined format or structure. It can be text-heavy and difficult to analyze using traditional methods.

Unstructured data is complex and requires advanced processing techniques such as natural language processing (NLP) or machine learning.

Examples:

  • Text files
  • Images
  • Audio files
  • Video files
  • Social media posts
  • ...